Picture for Long Li

Long Li

Evaluating the Feasibility of Inferring Dietary Behavior Change Receptivity from Egocentric Images of Eating Environment

Add code
May 27, 2026
Viaarxiv icon

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks

Add code
Apr 10, 2026
Viaarxiv icon

Leum-VL Technical Report

Add code
Mar 20, 2026
Viaarxiv icon

DyJR: Preserving Diversity in Reinforcement Learning with Verifiable Rewards via Dynamic Jensen-Shannon Replay

Add code
Mar 17, 2026
Viaarxiv icon

SQL-ASTRA: Alleviating Sparse Feedback in Agentic SQL via Column-Set Matching and Trajectory Aggregation

Add code
Mar 17, 2026
Viaarxiv icon

Anchored Policy Optimization: Mitigating Exploration Collapse Via Support-Constrained Rectification

Add code
Feb 05, 2026
Viaarxiv icon

IIB-LPO: Latent Policy Optimization via Iterative Information Bottleneck

Add code
Jan 09, 2026
Viaarxiv icon

Perception Before Reasoning: Two-Stage Reinforcement Learning for Visual Reasoning in Vision-Language Models

Add code
Sep 16, 2025
Viaarxiv icon

The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward

Add code
Sep 09, 2025
Figure 1 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 2 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 3 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Figure 4 for The Choice of Divergence: A Neglected Key to Mitigating Diversity Collapse in Reinforcement Learning with Verifiable Reward
Viaarxiv icon

VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning

Add code
Jul 30, 2025
Figure 1 for VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning
Figure 2 for VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning
Figure 3 for VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning
Figure 4 for VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning
Viaarxiv icon